PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID BGIOSGA004593-PA
Common NameOsI_04032
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; Liliopsida; Petrosaviidae; commelinids; Poales; Poaceae; BOP clade; Oryzoideae; Oryzeae; Oryzinae; Oryza; Oryza sativa
Family HD-ZIP
Protein Properties Length: 736aa    MW: 81049.5 Da    PI: 7.6639
Description HD-ZIP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
BGIOSGA004593-PAgenomeRISView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Homeobox42.21.4e-1368122256
                       T--SS--HHHHHHHHHHHHHSSS--HHHHHHHHHHCTS-HHHHHHHHHHHHHHHH CS
          Homeobox   2 rkRttftkeqleeLeelFeknrypsaeereeLAkklgLterqVkvWFqNrRakek 56 
                       r+ +++t +q e+Le +F  + +p+  ++++L++ +gL  +qVk+WFqN+R++ k
  BGIOSGA004593-PA  68 RRLQRLTGKQSEVLEGFFSICGHPDDGQKRHLSETTGLGLDQVKFWFQNKRTQVK 122
                       445789*********************************************9877 PP

2START87.62.7e-282224382205
                       HHHHHHHHHHHHHHHC-TT-EEEE....EXCCTTEEEEEEESSS.......SCEEEEEEEECCSCHHHHHHHHHCCCGGCT-TT-S...EEEE CS
             START   2 laeeaaqelvkkalaeepgWvkss....esengdevlqkfeeskv......dsgealrasgvvdmvlallveellddkeqWdetla...kaet 81 
                       la  a+ +l+ +a++  ++W+ ++    e +n++  + +  ++++      +++ea ra+ +v m+   +v  l+d    + + ++   + ++
  BGIOSGA004593-PA 222 LAKNAMHALIILAESHVALWFPVPgcayEVLNKMMAYDQAYPGDNsanaigFKTEATRAVSMVMMDYKSVVDFLMDPY-NYRTFFPeviSGAV 313
                       678899999999************999955555555555555555566699999************996665555555.55555554449999 PP

                       EEEECTT.......EEEEEEEEXXTTXX-SSX.EEEEEEEEEEE.TTS-EEEEEEEEE-TTS--.-TTSEE-EESSEEEEEEEECTCEEEEEE CS
             START  82 levissg.......galqlmvaelqalsplvp.RdfvfvRyirqlgagdwvivdvSvdseqkppesssvvRaellpSgiliepksnghskvtw 166
                       +++i +        g++qlm+ e++++splvp R+ +f+Ry+  l++g  v+ dvS+d  +  +      ++++ pSg+li++   + +kvt 
  BGIOSGA004593-PA 314 TNRIYTWptsdgynGVIQLMTVEMMFPSPLVPaRKCTFLRYCNVLNEGLVVVIDVSLDDGSIFS------KCRKMPSGFLIQSIRPNSCKVTA 400
                       999977777789*********************************************9988866......7********************** PP

                       EE-EE--SSXXHHHHHHHHHHHHHHHHHHHHHHTXXXXX CS
             START 167 vehvdlkgrlphwllrslvksglaegaktwvatlqrqce 205
                       +ehv  ++  +h+l+++ ++ gl++ga++wvat+ rq +
  BGIOSGA004593-PA 401 IEHVLADDTGVHELYQPCMN-GLVFGARRWVATMARQSA 438
                       *****************985.89***********99876 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
Gene3DG3DSA:1.10.10.603.8E-1459118IPR009057Homeodomain-like
SuperFamilySSF466891.88E-1559126IPR009057Homeodomain-like
PROSITE profilePS5007114.26864124IPR001356Homeobox domain
CDDcd000861.62E-1265124No hitNo description
SMARTSM003893.5E-1066128IPR001356Homeobox domain
PfamPF000463.5E-1168122IPR001356Homeobox domain
PROSITE patternPS00027099122IPR017970Homeobox, conserved site
PROSITE profilePS5084818.355212442IPR002913START domain
CDDcd088751.14E-92216438No hitNo description
SMARTSM002347.2E-6221439IPR002913START domain
PfamPF018521.6E-21222438IPR002913START domain
SuperFamilySSF559613.85E-20272437No hitNo description
SuperFamilySSF559611.19E-8456664No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0006355Biological Processregulation of transcription, DNA-templated
GO:0005634Cellular Componentnucleus
GO:0008289Molecular Functionlipid binding
GO:0043565Molecular Functionsequence-specific DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 736 aa     Download sequence    Send to blast
MEMNQQHNEG NESFVALMNG VAGDGTATLP NDGEQSMSIP ARELFAAIEA DSGLLPVNSS  60
NTNEKRKRRL QRLTGKQSEV LEGFFSICGH PDDGQKRHLS ETTGLGLDQV KFWFQNKRTQ  120
VKTMCWKEEN YKLSVENEIL RDENRRVKIA HCTAICLTCR NSSVQNQLAV EMERLMGQSE  180
WLQQEIARSN GTPPAANLAF QLNSSADYVF SGQHDQQMIA ELAKNAMHAL IILAESHVAL  240
WFPVPGCAYE VLNKMMAYDQ AYPGDNSANA IGFKTEATRA VSMVMMDYKS VVDFLMDPYN  300
YRTFFPEVIS GAVTNRIYTW PTSDGYNGVI QLMTVEMMFP SPLVPARKCT FLRYCNVLNE  360
GLVVVIDVSL DDGSIFSKCR KMPSGFLIQS IRPNSCKVTA IEHVLADDTG VHELYQPCMN  420
GLVFGARRWV ATMARQSARM RDVHHNKTAP QVSTKGRKNL MKLADDLLAS FAGGITATGG  480
GTWTVVIGAG TEKDIRVAYR RTTEGSSSYN AILSVSASLR LPLPMRKTFD LLRNLTHRCK  540
WDVLVHGSVV KEEVTIARGV GNDDTVTVLH CKRAGREDRG RTMILQNNGY DASGSFMVYS  600
QIDSELMNTM VLSPSDLPPG RGGPSLYPTG FSLLPDVEAA QDSSGIALGE VGGTLMTMGF  660
QIPVKLASGD RMYSRSAASA IRLMTDTIAL VKKTLMNEHS GIYGGEKGGP CADKNQEHAK  720
GIHRLRSPCI SQNLLN
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Os.183900.0flower| panicle| seed| stem
Expression -- Microarray ? help Back to Top
Source ID E-value
GEO220231560.0
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Nucleotide ? help Back to Top
Source Hit ID E-value Description
GenBankAF3178820.0AF317882.1 Oryza sativa transcription factor 1 (TF1) mRNA, complete cds.
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_015632244.10.0PREDICTED: homeobox-leucine zipper protein TF1
SwissprotQ5ZAY00.0TF1_ORYSJ; Homeobox-leucine zipper protein TF1
TrEMBLA2WVW00.0A2WVW0_ORYSI; Putative uncharacterized protein
STRINGBGIOSGA004593-PA0.0(Oryza sativa Indica Group)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
MonocotsOGMP104752833
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT4G21750.21e-101HD-ZIP family protein